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(57) Abstract 



A DNA sequence encoding human |l-casein (a human milk protein) and analogues of said DNA sequence which encodes 
a polypeptide having the calcium binding activity of human |i-casein or opioid aaivity or angiotension converting enzyme (ACE) 
inhibitory activity. The polypeptide encoded by said DNA sequence or analogues may be used in the production of recombinant 
human (i-casein either by means of a prokaryotic or an eukaryotic produaion system, or more advantageously by means of pro- 
duction in transgenic non-human mammals >uch as cows. The recombinant human |)-casein is a useful constituent of infant for- 
mulae used for feeding infants as a substitute for human milk. i.e. maternal milk. The recombinant human P-casein makes it pos- 
sible to prepare an infant formula having a closer similarity to human milk and thus an improved nutritional and biological 
value . — — 
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HUMAN 3-CASEIN, PROCESS FOR PRODUCING IT A.MD USE THEREOF. 

The presenr invencion relates to a DNA sequence encoding Che human 
milk prorein ^- casein. More specifically, the DNA sequence encodes a 
poiypepcide having the amino acid sequence shovm in Figure 1 below. 
5 The DNA sequence. is advantageously used in the production of recombi- 
nant huaian ^-casein, either by means of a prokaryotic or an eukaryo- 
Cic production system, or more advantageously by means of production 
in transgenic non-human mamraals such as cows. The main use of the 
recombinant human d-casein is as a constituent of infant formulae 
10 used for feeding infants as a substitute for human milk. When used as 
a constituent of infant formulae, it is contemplated that the recom- 
binant human 5-casein prc^-idts a substantial improvement of the 
nutritional ai-.d biological value of the formulae in that a closer 
simiLaritv to hunian milk is obtained. 
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BACKGROUND OF THE irr.'E.^TIO.S 



It is well knovn chat human milk- feeding is considered superior to 
formula- feeding for infants. Not only does human milk provide a well- 
balanced supply of nutrients, but it is also easily digested by the 
20 infant. Thus, several biologically active components which are known 
to have physiological functions in the infant are either a constitu- 
ent of human milk or produced during the digestion thereof, including 
components involved in the defense against infection and components 
facilitating the uptake of nutrients from human milk. 

25 In spite of the great efforts which have been invested in preparing 
infant formulae-, it has not been possible to produce a formula which 
to any substantial extent has the advantageous properties of human 
milk. Thus, infant formula, often prepared on the basis of cow milk, 
is generally incompletely digested by the infant and is lacking sub- 

30 stances known to have effect on the physiological functions of the 

infant. In order to obtain an infant formula with a nutritional value 
similar to human milk, a niuaber of additives including protein frag- 
ments, vitamins, minerals etc., which are normally f rmed r taken up 
during the infant's digestion of human milk, are included in the for- 

2 5 mula with the consequent risk of posing an increased strain on and 



I 



2 

possible lonjj- i-eiin damage of CaiiL oi^uwa iuch a= liver and kid- 

ney. Another disadvantage associated with the use of cow milk-based 
formulae is the increased risk for inducing allergy in the infant 
against bovine pr teins. 

5 As an alternative to cow milk-based infant formulae, human milk 

obtainable from so-called milk banks has been used. However, feeding 
newborn infants with human milk from milk banks has in the recent 
years to an increasing extent been avoided, because of the fear for 
the presence of infective agents such as HIV and CMV in human milk. 
10 In order to destroy the infective agents in human milk it has become 
necessary to pasteurize the milk before use. However, by pasteuriza- 
tion the nutritional value and the biological effects of the milk 
components are decreased and human milk is used to a still lesser 
extent . 

15 Human milk differs markedly from that of other mammalian species, 

including cows, in that it contains a lower over-all protein content 
and lower ratio of casein/whey as well as a different protein compo- 
sition. For instance, the casein subclasses of human milk comprises 
only fl-casein and /c-casein, whereas the bovine casein subclasses are 

20 Q-casein, 5-casein, and K-casein (Miller et al . ) . Also the amino acid 
sequences of human milk protein differ from that of other mammalian 
milk proteins . 

^-Casein is a phosphorylated protein which is present in milk^ of 
several species including humans in which it is the major casein 

25 subunic. This protein - or its digested fragments -is believed to 

enhance calcium absorption by chelating calcium to its phosphorylated 
residues and thereby keeping it in an absorbable form. Human ^-casein 
is easily digested by newborn infants and the digestive products have 
been found to play an important role in the calcium uptake, and thus 

3C in the mineralization of the skeleton. A digestion product (^-caso- 

morphin) of human ,5 -casein has been found to have opioid activity and 
may be involved in the sleeping patterns of breast-fed infants. 

It would be desirable to he able to prepare an infant formula with a 
composition closer to chat of human milk and thus avoid the ab ve 
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disadvantages associaced with bovine milk-based infant formula, e.g. 
a formula comprising human milk proteins. However, this w uld require 
chat human milk proteins are obtainable in large quantities. Although 
human milk proteins may be purified directly from human milk, this is 
5 not a realistic and sufficiently economical way to obtain the large 
quantities needed for large scale formula production, and other 
methods mus.t be developed before an infant formula comprising htiman 
milk proteins may be prepared. 



So far. little detailed characterization of human milk proteins, 
10 e.g. in terms of amino acid sequences, has been made. Greenberg ec 

al . reported the isolation and purification of native human ^-casein 
and the amino acid sequence thereof. The total composition of Che 
native human /9-casein was seated to be 212 amino acids and various 
phosohorvlation sites were identified in the sequence. 

i 

15 Several milk protein genes, primarily from rodents or dairy animals, 
have been cloned and sequenced, cf. references 11-21 numbered in the 
List of References below, but knowledge of Che genes encoding human 
milk proteins is still sparse. Recently, Menon and Ham disclosed the 
isolation and sequencing of a parcial cDNA clone encoding human 0- 
20 casein. The clone included the coding sequence corresponding Co amln 
acid residues 146-212 of the mature protein and the encire 3' non- 
coding region. The deduced partial amino acid sequence was compared 
III with the sequence of the native protein reported by Greenberg ec al. 

and several amino acid differences were found between Che tfwo amino 
25 acid sequences. Menon and Ham have not, however, succeeded in isola- 
ting the entire cDNA encoding human ^-casein, nor in providing a DNA 
sequence useful for the production of a polypeptide comprising a 
substantial part (of more than 67 amino acid residues) of the human 
;9-casein sequence. This is accomplished for Che firsc Cime by Che 
30 present inventors (Lonnerdal et al, published on 20 August, 1990, 
which is hereby incorporated by reference) . 
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BRIEF DESCRIPTION OF THE I .WENT I Cv: 



Ic is an object of the presenc invention zo provide a means for pro- 
ducing recombinant human ;9-casein in a high yield and ac a realistic 
price . 

Accordingly, in one aspect the present invention relates to a DNA 
sequence encoding a polypeptide comprising the amino acid sequence 
shown in Figure 1 or 

an analogue of said DNA sequence which 

1) hybridizes with the DNA sequence shown in Figure 1 or a 
specific part thereof under the stringent hybridization condi- 
tions or 

2) encodes a polypeptide, the amino acid sequence of which is at 
least 852 homologous with the amino acid sequence shown in 
Figure 1, or 

3) constitutes an effective subsequence of said DNA sequence, 

which encodes a polypeptide having the calcium binding activity of 
human 5-casein, or having opioid activity, or having angiotensin 
converting enzyme (ACE) inhibitory activity, or a polypeptide having 
a combination of any of these activities. ^ 

The DNA seque.nce of the invention was determined on the basis of a 
cDNA clone isolated from a human mammary gland cDNA library using 
hybridization with a specific ^2-mer oligonucleotide probe prepared 
on the basis of amino acid residues 117-130 of the human /9-casein 
sequence as disclosed by Greenberg et al . The procedure used for 
isolating the human ^-casein cDNA sequence is outlined in Example 1 
below . 

The stringent hybridization conditions referred to above are to be 
understood in their conventional meaning, i.e. that hybridization is 
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carried out at 67 °C in ^xS^C ami final vashing ac SyC in IxSSC using 
Che method specified in the "Definition" part of the Examples below. 



[The term "homologous" is used here to illustrate the degree of iden- 
tity between the amino acid sequence of a given polypeptide and the 
amino acid sequence shown in Figure i. The amino acid sequence to be 
compared with the amino acid sequence shown in Figure 1 may be dedu- 
ced from a DNA sequence, e.g. obtained by hybridization as defined 
above , or may be obtained by conventional amino acid sequencing 
methods. The degree of homology is preferably determined on Che amino 
acid sequence of a mature polypeptide, i.e. without taking any leader 
.sequence into consideration. It is preferred that the degree of 

omology is at least 90%, such as 95X or even 98Z with the amino acid 

equence shown in Figure 1. 



The tern: "effective subsequence" as used above refers co a subsequen- 
ce which encodes a peptide being at least partially functional with 
respect cc the activities of h'jman p-casein as defined above. The 
subsequence may be the result of a truncation at either end of the 
DY'A sequence or of the removal of one or more nucleotides or nucleo- 
tide sequences wichin DN'A sequence. Preferably, when encoding a 
peptide having the opioid or the ACE- inhibitory accivity of human 0' 
casein, the effective subsequence comprises at least 15 nucleotides 
such as at least 20 nucleotides. 



The terr. "calcium-binding activity" denotes the capability bf the 
polypeptide of the invention to bind calcium and may be determined by 
equilibriuro dialysis or a similar technique. The term "opioid activi- 
ty" denotes the peptide's opiate-like effects and capability of the 
peptide to bind to opiate receptors (opiate receptor affinity). The 
"opioid accivity" is determined as disclosed by Brantl, 1984 and 
xMiglior i - Samour et al. The term "ACE- inhibitory activity" denotes the 
capability.- of the peptide to inhibit the angiotensin converting 
en2'.TT.e (ACE) and has important indications for the treatment of heart 
disorders. The ACE- inhibitory effect i"; determined by use f a method 
as disclosed bv Maruvama et al. and Kohir.ura et al . 



In this c.onnecLioii , it should be noce.ri rhat the terms "calcium-bind- 
ing activity", "opioid activity" and "ACE- inhibitory activity" and 
related terms should be understood to be qualitative and/ r quantita- 
tive that is, relating first of all to the nature of the activity, 
such as the nature of the calcium binding activity, and/or t the 
level of activity of the polypeptide as determined with reference to 
human ^-casein. Concerning the ACE- inhibitory effect or the opiod 
effect, these are also of the same qualitative nature as ascribed in 
the literature to digestive fragments of human ^-casein having ACE- 
inhibitory activity or opioid activity, respectively. 

In this connection the term "digestive fragment" refers to the pep- 
tide fragment (s) which, in nature, are generated during the digestion 
of human yS- case in by the infant fed on human milk. Such fragments may 
be prepared, e.g. by cleavage of recombinant human ^-casein, by 
expression from DNA sequences encoding such fragments, or by use of 
conventional peptide synthesis. 

In another aspect the present invention relates to a polypeptide 
produced by the DNA sequence of the invention, preferably a recom- 
binant human ^-casein protein having the amino acid sequence shown in 
Figure 1 or a variant thereof having the calciiim- binding activity of 
human ^-casein or a subsequence of the amino acid sequence shown in 
Figure 1 or the variant having ACE- inhibitory activity or opioid 
activity of a digestive fragment of human ^-casein. The variant and 
the subsequence is further defined below. I 

In yet another aspect the present invention provides a mammalian 
expression system comprising a DNA sequence encoding human ^^casein 
inserted into a gene encoding a milk protein of a non-human mammal so 
as to form a hybrid gene which is expressible in the mammary gland of 
an adult female of a mammal harbouring said hybrid gene so that human 
;5-casein is produced when the hybrid gene is expressed. 

In yet a further aspect, the present invention relates to a method of 
producing a transgenic non-human mammal capable of expressing hvunan 
fl-casein, comprising injecting a mammalian expressi n system as de- 
fined above into a fertilized egg or a cell of an embryo of a mammal 
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so as "o incorporate rhe expression system into the germline of the 

maiiur.al and developing the resulting injected fertilized egg or embryo 
into an adult female mammal. 

DETAILED DESCRIPTION OF THE PRESENT IiV^/EKTION 

5 The cDNA sequence shovm in Figure 1 has an overall length of 1065 bp, 
including the poly(A) tail. The open reading frame begins from che 
first nucleotide "A" at the 5' -end and codes for a signal peptide of 
15 aitino acids and the mature casein consisting of 210 amino acid 
residues. The coding cDNA of hvrnian 5-casein is flanked by one 390 bp 

D non-coding region at the 3 '-terminal end and one of 48 bp at the 5'- 
tcrminus. The size of the human 3-c sein cDNA shown in Figure 1 is 
similar to that of ovine ^-casein (Frovot et al.). 

The smino acid sequence deduced from the nucleotide sequence shown in 
Figure 1 shovs some discrepancies compared to that reported by Green- 

5 berg et al . Thus, the deduced amino acid sequence is constituted by 
210 amino acid residues and no codons for the amino acids at posi- 
tions 19 and 207 of the 212 amino acid sequence reported by Greenberg 
et al . are found. Other discrepancies are found at positions 15 (Thr 
instead of Pro). 32 (Gly vs. Thr), 34 (Glu vs. Gin), 104 (Ser vs. 

:> Gin), 133 (Leu vs. Ser). 158 (Gin vs , Glu), 167 (Gin vs. Glu), 169 
(Val \'s. Leu). 173 (Gin vs. Val) , 192 (Thr vs. Pro). 198 (Thr vs. 
Pro). 19? (Glr. vs. Glu). and 201-206 ( Leu-Ala-Pro-Val-His -Asn vs . 
Ser-Thr-7hr-Glx-Ala-Asz-His) . 

.A rslati/eiy large degree of homology, 45-622, is found between human 
5 ^-casein and the corresponding proteins from other species. Further- 
more, the 15 residues signal peptide of human ^-casein is identical 
to those of bovine, ovine and rabbit fl-caseins and, except for 1 re- 
sidue, identical ro those of rat and mouse 5 -caseins. By comparison 
between the D!.'.-\ sequences encoding .5-casein of different origins, 
0 single base substitutions betveen species, which usually occur at the 
wobble position and thus code for the same amino acid, have been 
found . 



A consensus polyadenylaticn recognition signal AAUAAA is located 16 
nucleotides upstream from the poly(A) tail. An 11 nucleotides long 
motif (bp 823-833; TTTATTTATTT) which might be involved in the stabi- 
lization of the mRNA, is also found and corresponds to a sequence 
5 found in connection with ^-casein genes of other origins (Provot et 
al.). 

An interesting DNA sequence translatable into a human fl-casein poly- 
peptide is a sequence comprising a human o-casein gene. Accordingly, 
in a further aspect, the present invention relates to a DNA sequence 
10 comprising a human ^-casein gene or an effective subsequence thereof 
containing elements capable of expressing a polypeptide having the 
activity of human ;3-casein or a digestive fragment thereof, or an 
analogue of said DNA sequence which 

1) hybridizes with the DNA sequence shown in Figure 1 or a 

15 specific part thereof under stringent hybridization conditions 

or 

2) encodes a polypeptide, the amino acid sequence of which is at 
least 85% homologous with the a.T.ino acid sequence shown in 
Figure 1 . or 

20 3) constitutes an effective subsequence of said DNA sequence, 

which encodes a polypeptide having the calcium binding activity of 
human yS -casein, or opioid activity, or angiotensin converting enzyme 
(ACE) inhibitory activity or a combination of any two or three of 
these activities. 

25 In the present context, the term "gene" is used to indicate a DNA 

sequence which is involved in producing a polypeptide chain and which 
includes regions preceding and following the coding region (5' -up- 
stream and 3' -downstream sequences) as well as intervening sequences, 
the so-called introns , which are placed between individual coding 

30 segments (so-called exons) or in the 5' -upstream or 3 ' -downstream 

region. The 5' -upstream region comprises a regulatory sequence which 
controls the expression of the gene, typically a promoter. The 3'- 
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downstrsam regicn comprises sequences which are involved in termina- 
tion of transcription of the gene and optionally sequences resp n- 
sible for polyadenylation of the transcript and the 3' untranslated 
region. The term "effective subsequence" of the gene is to be under- 
5 stood in the same manner as defined above in connection with the DNA 
sequence . 

The hybridization may be carried out as described in the "Definition" 
part of the Examples below, preferably on the basis of a probe com- 
prising the coding part of the DNA sequence shown in Figure 1 below. 
J The terms "homologous" and "effective subsequences" are used in a 
similar manner as that defined above. 



Preferably, the polypeptide encode > by the analogue of the DNA se- 
quence is at least 902 homologous, such as at least 95X or even 98X 
homologous with the amino acid sequence shown in Figure 1. 

15 The ^-casein gene or part thereof may be isolated and sequenced using 
the procedure described in Example IC below. In Example IB, different 
genetic variants of the human ^-casein gene are discussed. 

An example of a specific analogue of the DNA sequence of the inven- 
tion is a DNA sequence which comprises the DNA sequence shown in 

20 Sequence Listing 2 and particularly adapted for expression in E. 

coli. This DNA sequence is one which, when inserted in E. coli to- 
gether with suitable regulatory sequences, results in the 'expressi n 
of a polypeptide having the amino acid sequence shown in Figure 1. 
Thus, this DNA sequence comprises specific codons recognized by E. 

25 coli. The preparation of this DNA sequence is described in Example 2. 



As mentioned above , the DNA sequence shown in Figure 1 encodes a 
polypeptide comprising the functional domain/domains of human 0- 
casein as well as the signal peptide naturally associated therewith. 
While the presence of a signal peptide in most cases is a prerequi- 
site for allowing the polypeptide expressed from the DNA sequence to 
be transport ;=d out of the cell in which it is produced, the nature 
and origin of the particular signal peptide to be used may vary and 



need not l-r: tlie i.iHiiHi i nacuraii'.- associated wirh the human 

/S-casein. 

In accordance herewith, a particularly interesting DNA sequence of 
the invention is a DNA sequence which encodes a polypeptide compris- 
ing amino acids 1-210 shown in Figure 1. i.e. the amino acids corre- 
sponding to the mature human fl-casein. 

Human p- case in is highly phosphorvlated at serine and threonine 
residues close to the N- terminal end. and this phosphorylated part of 
fi-casein is believed to give the molecule its capacity to bind cal- 
cium and thus to participate in micelle formation. Thus, the N-termi- 
nal part of the molecule is highly important in terms of the biologi- 
cal activity of 5-casein. In accordance herewith, an important sub- 
sequence of the DNA sequence of the invention is a DNA sequence which 
at least encodes the first part of the amino acid sequence shown in 
Figure 1. especially the part comprising amino acids 1-12. However, 
also other parts of the molecule mav be important in connection with 
the calcium-binding activity of human p-casein. 

Further interesting subsequences of the nucleotide and deduced amino 
acid sequences shown in Figure 1 are discussed in the following. A 
nucleotide sequence encoding a heptapeptide (VPYPQRA) expected to 
have .nCE- inhibitory activity is found at amino acid residues 167-173 
of the amino acid sequence shown in Figure 1. The expected ACE- inhi- 
bitory activity is based on the fact that a similar ^-casein peptide 
having such activity and comprising 6 of the above listed 7 amino 
acid residues has been found to have ACE- inhibitory activity (Maru- 
yama et al.'. In this connection, also the pentapeptide corresponding 
to positions 16S-L72 of the amino acid sequence shown in Figure 1 is 
conterr.piated to have ACE- inhibitory activity, which pencapepCide in 
nature is contemplated to be a cleavage product of the heptapeptide 
above, .-.iso a peptide constituted by a part of the amino acids 39-52 
of the sequence shown in Figure 1, preferably a heptapeptide, a hexa- 
peptide or a heptapeptide, is contemplated to have ACE- inhibitory 
activitv. This expectation is based on the result of an analysis of 
the .ACE - inhibitory activity of synthetic peptides having a similar 
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amino acid sequence as pcj.LiJea luu-uU l-.i L;ie above ciced region of 
human fl-casein (Kohmura e- al. ) 

Peptides having opioid acriviry and immune stimulatory activity may 
be determined on "he basis cf the disclosure of Brantl. 

5 In a further aspect, the present invention relates to a modified DNA 
sequence which differs from a DNA sequence of the invention as defi- 
ned above in that at lease one nucleotide has been deleted, substitu- 
ted or modified or at least one additional nucleotide has been in- 
serted so as to result in a DNA sequence which encodes a polypeptide 
0 having a calcium-binding activity which is similar to. Increased or 
decreased as compared to the calcium-binding activity of htiman fi- 
casein. Other interesting modif icatio;. j result in peptides having a 
opioid activity or ACE- inhibitory activity. 

The polypeptide encoded by the modified DNA sequence normally has an 
amino acid sequence which is different from the amino acid sequence 
of the human 5-casein. It will be understood that a modified DNA 
sequence cf the invention will be of importance in the preparation of 
novel polypeptides having a modified activity as compared Co human fi- 
casein or digestive fragments thereof or other similarly important 
activities . 

U"nen "substitution" is performed, one or more nucleotides in the full 
nucleotide sequence are replaced with one or more different nucleo- 
tides: when "addition" is performed, one or more nucleotides are 
added at either end of the full nucleotide sequence; tfhen "insertion" 
is performed one or more nucleotides within the full nucleotide 
sequence is inserted: and when "deletion" is performed one or more 
nucleotides are deleted from the full nucleotide sequence whether at 
either end of the sequence or at any suitable point within it. 

A modified DNA sequence may be obtained by well-known methods, e.g., 
by use of site -directed mutagenesis. 

An example of an important modified DN.A sequence of the invention is 
a DNA sequence in which additional codons encoding serine or threo- 



12 

nine residues have been inserted so as to result in a modified DKa. 
sequence encoding a polypeptide having an increased number of resi- 
dues CO be phosphorylated. The additional residues may be inserted 
either by being added at either end or within a DNA sequence f the 
invention or by replacing one or more non-serine or non- threonine 
codons present in a DNA sequence of the invention. A polypeptide 
encoded by such a modified DNA sequence is contemplated to have a 
higher degree of phosphorylation and thus an increased calcium-bind- 
ing activity as compared to native human ^-casein. The polypeptide 
produced from such a modified DNA sequence may be used as a nutrition 
supplement for populations which need to increase their calcium 
uptake, e.g. premature infants, women and old people. 

Similarly, when a reduced calcivim uptake is of interest, a modified 
DNA sequence of the invention encoding a polypeptide which has a 
lower number of residues to be phosphorylated than the polypeptide, 
the amino acid sequence of which is shown in Figure 1, is of inte- 
rest. Here, one strategy would be to remove one or more codons for 
serine or threonine residues or replace one or more of such codons by 
non-serine or non- threonine residues. 

Another example of an interesting modified DNA sequence is a DNA 
sequence which encodes the amino acid sequence of a naturally occur- 
ring human /3-casein variant having an amino acid sequence different 
from the one shown in Figure 1 such as the variant disclosed by 
Greenberg et al. discussed above or other genetic variants as discus- 
sed in Example IB. For this purpose, site-directed mutagenesis would 
be carried out using specific oligonucleotide probes conferring an 
exchange/removal of the relevant amino acid residues. 

Another important use of a DNA sequence of the invention as defined 
above is in the preparation of a fusion protein comprising on the one 
hand a polypeptide comprising the amino acid sequence shown in Figure 
1 or an analogue or subsequence thereof as defined above and on the 
other hand a polypeptide of another origin, e.g. a polypeptide or 
peptide part of another milk protein, e.g. a human milk protein such 
as Q- lactalbumin, or a non-human milk protein such as a bovine or 
ovine milk protein such as bovine ^-casein. The fusion protein may be 
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prepared by fusing a DNA sequence of che invencion with a DNA sequen- 
ce encoding che other part of the fusion protein and the proper regu- 
latory sequences in a manner which allows the expression of che 
fusion protein to occur. 



The DNA sequences of the invention explained herein may comprise 
natural as well as synthetic DNA sequences, the natural sequence 
typically being derived directly from cDNA or genomic DNA, normally 
of mammalian origin, e.g. as described below. A synthetic sequence 
may be prepared by conventional methods for synthetically preparing 
DNA molecules, e.g. using the principles solid or liquid phase pep- 
tide svTithesis. Of course, also the DNA sequence may be of mixed 
cDNA and genomic, mixed cDNA and synthetic and mixed genomic and 
synthetic origin. 

The terms "sequence", "subsequence", "analogue" and "polypeptide" as 
used herein with respect to sequences, subsequences, analogues and 
polypeptides according to the invention should of course be under- 
stood as not comprising these phenomena in their natural environment, 
but rather, e.g., ir. isolated, purified, in vitro or recombinant 
form. When reference is made to a DNA sequence of the invention this 
should be understood to include "analogues", "subsequences" and 
"modified sequences" as defined above. Similarly, when reference is 
made to "a polypeptide of the invention" this should be understood to 
include any of the polypeptides defined in the following. 

I 

In another important aspect, the present invention relates to a 
polypeptide encoded by a DN.A sequence of the invention as defined 
above. A particularly interesting polypeptide of the invention is a 
recombinant human 5-casein polypeptide comprising the amino acid 
sequence shown in Figure 1 or a subsequence thereof having the cal- 
cium-binding activity of human ;9-casein. An example of an important 
polypeptide comprising an important subsequence of said amino acid 
sequence is a polypeptide ccmprising amino acid residues 1-210 of che 
amino acid sequence shown in Figure 1 corresponding to the mature 
recombinant human 5-=asein without a signal peptide. 



ct iDCTiTt iTc cucr— 



As it wi.! i 'r>(i apparent rrom tne aoove disclosure, anocher interesting 
polypeptide of the present invention is one which differs fr m a 
polypeptide comprising the amino acid sequence shown in Figure 1 in 
chat at least one amino acid residue has been substituted with a dif- 
ferent amino acid residue and/or in that at least one amino acid re- 
sidue has been deleted or added so as to result in a polypeptide com- 
prising an amino acid sequence being different from the amino acid 
sequence shown in Figure 1 and having a similar, increased or decrea- 
sed calcium-binding activity as compared to the calciura-bindlng acti- 
vity of human ^-casein or having ACE- inhibitory activity or opioid 
activity. Examples of a strategy for designing and preparing modified 
polypeptides of the invention are apparent from the above disclosure. 

The polypeptide of the present invention is one in which at least one 
amino acid residue has been modified by post- translational modifica- 
tion and is preferably in glycosylated and/or phosphorylaCed form. 
Normally, glycosylation is achieved when the polypeptide is expressed 
by a cell of a higher organism such as yeast or preferably a mammal, 
whereas phosphorylation is achieved also in the case of expression in 
lower organisms such as bacteria. The amino acid residues of. the 
polNTseptide of the invention which are normally phosphorylated are 
mentioned above. Glycosylation is normally found in connection with 
amino acid residues Asn, Ser, Thr or hydroxy lysine . 

In a further aspect, the present invention relates to a replicable 
expression vector which carries and is capable of mediating the s 
expression of a DNA sequence encoding human ^-casein. 

In the present context, the term "replicable" means that the vector 
is able to replicate in a given type of host cell into which it has 
been introduced. Immediately upstream of the human ^-casein DNA 
sequence there may be provided a sequence coding for a signal pep- 
tide, the presence of which ensures secretion of the human ^-casein 
expressed by host cells harbouring the vector. The signal sequence 
mav be the one naturally associated with the human ^-casein DNA 
sequence or of another origin. 
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The vector may be any vector './hich scy conveniently be subjected to 
recombinant DNA procedures, and the choice of vector will often 
depend on the host cell into which it is to be introduced. Thus, the 
vector nay be an autonomously replicating vector. I.e. a vector which 
5 exists as an extrachromosomal entity, the replication of which is 

independent of chromosomal replication; examples of such a vector are 
a plasmid, phage, cosmid, mini -chromosome or virus. Alternatively, 
the vector may be one which, when introduced in a host cell, is 
integrated in the host cell genome and replicated together with the 

iO chromosome (s) into which it has been integrated. Examples of suitable 
vectors are a bacterial expression vector, e.g. as exemplified in 
Example 2, and a yeast expression vector, e.g. as exemplified in 
example 3. The vector of the invention may carry any of the DNA 
sequences of the invention as defined above and be used for the 

1: expression of any of the polypeptides of the invention defined above. 
However, when a bacterial vector is concerned it is preferred that 
the DKA sequence encoding human ^-casein comprises the sequence shown 
m Sequence Listing 2, i.e. the sequence specifically adapted to 
expression in bacterial cells. 

20 The present invention further relates to a cell harbouring a repli- 

cable expression vector as defined above. In principle, this cell may 
be of any type of cell, i.e. a prokaryotic cell such as a bacterium, 
e.g. E. coll, a unicellular exikaryotic organism, a fungus or yeast, 
e.g. Ssccharomyces cerevisLae or a cell derived from a. multicellular 

25 organise., e.g. a mammal. The mammalian cells are especially suitable 
for the purpose and are further discussed below. 

In another important aspect, the invention relates to a method of 
producing recombinant human /9-casein, in which a DNA sequence encod- 
ing human casein is inserted in a vector which is able to replicate 

30 in a specific host cell, the resulting recombinant vector is intro- 
duced into a host cell which is grown in or on an appropriate culture 
Tiedium under appropriate conditions for expression of human ^-casein 
and the hu-nan ^-casein is recovered. The medium used to gr w the 
cells may be any conventional medium suitable for the purp s . A 

25 suitable vector may be any f the vectors described above, and an 

appropriate host cell may be any f the cell types listed ab ve. The 
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advantage thac large yields of recorr.binar.r human 5-casein are obcain- 
able ac reasonable coses and. especiallv when Che non-human mamnal is 
a cow, chac che recombinant human 5-caseir. is produced in milk which 
is che normal constituent of. e.g.. infant formulae so that no exten- 

5 sive purification is needed when the recombinant human j8-casein is Co 
be used as a nutrient supplement in milk-based products. Furthermore, 
production in a higher organism such as a non-human mammal normally 
leads to the correct processing of the raanunalian protein, e.g. with 
respect to post - translational processing as discussed above and 

0 proper folding. Also large quantities of substantially pure human fi- 
casein mav be obtained. 



Accordingly, in a further important aspect, the pre-ent invention 
relates to a mammalian expression svstern comprising a DNA sequence 
encoding human 5-casein inserted into a gene encoding a milk protein 
of a non-huir.an mammal so as to form a h^.'brid gene which is expres- 
sible in the mammary gland of an adult female of a mammal harbouring 
said hybrid gene. 

The DNA sequence encoding human 5-casein is preferably a DNA sequence 
as defined above encoding a polypeptide comprising the amino acid 
sequence sho'»-n in Figure 1 or a human 5-casein gene or an analogue or 
effective subsequence thereof. 

The mammary gland as a tissue of expression and genes encoding milk 
proteins are generally considered to be particularly suitable' for use 
in the production of heterologous proteins in transgenic non-human 
mamnals as milk protains are naturally produced at high expression 
levels in the mammary gland. Also, milk is readily collected and 
available in large quantities. In the present connection the use of 
milk protein genes in the production of recombinant human ^-casein 
has the further advantage that it is produced under conditions simi- 
C lar to the its natural production conditions in terms of regulation 
of expression and production location (the m.amraary gland). 

In the present .context che terra "hybrid gene" denotes a DNA sequence 
comprising on the one hand a DN'A sequence encoding human ^-casein as 
defined above and on the other hand a DN.A sequence of the milk prote- 



18. 

in ger.e which is capable of mediating Che expression of che hybrid 
gene produce. The term "gene encoding a milk protein" den tes an 
entire gene as well as an effective subsequence thereof capable of 
mediating and targeting the expression of che hybrid gene to the 
5 tissue of interest, i.e. the mammary gland. Normally, the effective 
subsequence is one which at least harbours one or more of a promoter 
region, a transcriptional start site, 3' and 5' non-coding regions 
and structural sequences. The DNA sequence encoding human ^-casein is 
preferably substantially free from prokaryotic sequences, such as 
10 vector sequences, which may be associated with the DNA sequence 
after, e.g., cloning thereof. 

The hybrid gene is preferably formed by inserting in vitro the DNA 
sequence encoding human ^-casein into the milk protein gene by use of 
techniques known in che art. Alternatively, the DNA sequence encoding 
15 human 5-casein can be inserted in vivo by homologous recombinantion. 

Normally, the DNA sequence encoding human ^-casein will be inserted 
in one of the first exons of the milk protein gene of choice or an 
effective subsequence thereof comprising the first exons and prefe- 
20 rably a substantial part of the 5' flanking sequence which is be- 
lieved to be of regulatory importance. 

The hybrid gene preferably comprises a sequence encoding a signal 
peptide so as to enable the hybrid gene product to be secreted cor- 
rectly into the mammary gland. The signal peptide will cypicall*y be 

25 the one normally found in the milk protein gene in question or one 
associated with the DNA sequence encoding human /3-casein. However, 
also other signal sequences capable of mediating the secretion of the 
hybrid gene product to the mammary gland are relevant. Of course, the 
various elements of the hybrid gene should be fused in such a manner 

30 as to allow for correct expression and processing of the gene pro- 
duct. Thus, normally the DNA sequence encoding the signal peptide of 
choice should be precisely fused to the N- terminal part of the DNA 
sequence encoding human ^-casein. In the hybrid gene, che DNA sequen- 
ce encoding human /9-casein will n rmally comprise its stop codon, but 

35 n t its own message cleavance and polyadenylati n sice. D wnstream of 
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Che DNA scquc—a ancnriirj. human J-.j=aein, Che raRNA processing sequen- 
ces of the milk protein gene will normally be retained. 

A n"-ur.ber of factors are conrer.p laced to be responsible for the actual 
expression level of a particular hvbrid gene. The capability of the 
5 promoter as well of other regularory sequences as mentioned above, 
the integration site of Che expression system in the genome of the 
mammal, the integration site of the DNA sequence encoding human fi- 
casein in the milk protein encoding gene, elements conferring posc- 
transcriptional regulation and other similar factors may be of vital 
10 importance for Che expression level obcained. On the basis of the 

knowledge of the various factors influencing the expression level of 
the hybrid gene, the person skilled in the art would know how to 
design an expression system useful for Che present purpose. 

.A variety of different milk proteins are secreted by the mammary 
15 gland. Two main groups of milk proteins exist, namely the caseins and 
the whey proteins. The composition of milk from different species 
varies qualitatively as well as quantitatively with respect to these 
proteins, .lost non-human mammals produces 3 different types of case- 
in, namely a-casein, a-casein and /c-casein. The most common bovine 
20 whey proteins are a - laccalbumin and ^- lactalbumin. The composition of 
milk of various origins are further disclosed in Clark et al., 1987. 

The milk protein gene to be used may be derived from the same species 
as the one in which the expression system is to be inserted,! or it 
may be derived from another species. In chis connection it has been 
25 shown thac the regulatory elements chat target gene expression to the 
mammary gland are functional across species boundaries (which may be 
due to a possible common ancestor) (Hennighausen ec al.)- 

Examples of suitable genes encoding a milk protein or effective 
subsequences thereof to be used in the construction of an expression 
30 svstem of the invention are normally found among whey proteins of 
various mammalian origins, e.g. a whey acidic protein (WAP) gene, 
preferably of murine origin, and a 5- lactoglobulin gene, preferably 
of ovine origin. Also casein genes of various origins may be found to 
be suitable for the transgenic production of human /8-casein, e.g. 



bovine qSI -casein and rabbir o-^aaeir.. The presencly preterred gene 
is a murine WAP gene as this has been found co be capable of provi- 
ding a high level expression of a number of foreign human proteins in 
milk of different transgenic animals (Hennighausen at al . ) . 

5 Another sequence preferably associated with the expression system of 
the invention is a so-called expression stabilizing sequence capable 
of mediating high-level expression. Strong indications exist that 
such stabilizing sequences are found in the vicinity of and upstream 
of milk protein genes. 



10 



The DNA sequence encoding a human ;3- case in to be inserted in the 
expression system of the invention may be of cDNA, genomic or synthe- 
tic origin or any combination thereof. While some expression systems 
have been found to function best when cDN.A encoding a desirable 
protein is used, others have been found to require the presence of 
15 introns and other regulatory regions in order to obtain a satisfac- 
tory expression (Hennighausen et al.). In some cases it may be advan- 
tageous to introduce genomic structures as polypeptide encoding 
element in vector constructs compared to cDNA elements (Brinster et 
al.). The intron and exon structure may result in higher steady state 
20 mRNA levels that obtained when cDNA based vectors are used. When the 
expression system is based on the WAP gene, a cDNA sequence is pre- 
ferred as it has been verified by various experiments that the WAP 
gene is capable of providing as good an expression of cDNA as of 
genomic DNA. or even a better expression (Hennighausen et al.).'ln 
25 Example 4, an expression system based on the murine WAP gene and a 

cDNA sequence of the invention encoding hu:Tian /9- casein is illustrated 
and further discussed. 



In a further aspect, the present invention relates to a hybrid gene 
comprising a DNA sequence encoding human .5-casein inserted into a 
gene encoding a milk protein of a non-human mammal, the DNA sequence 
being inserted in the milk protein gene in such a manner chac ic is 
expressible in the mammary gland of an adult female of a mammal 
harbouring the hybrid gene. The hybrid gene and its constituents have 
been discussed in detail above. The hybrid gene constitutes an impor- 
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tanc incermediace in the construction of an expression sy&Cem of Che 
invention as disclosed above. 

In another aspect, the present invention relates to a non-hiiman 
mammalian cell harbouring an expression system as defined above. The 
j mamnialian cell is preferably an embryo cell or a pro-nucleus. The 

expression system is suitably inserted in the mammalian cell using a 
method as explained in the following and specifically illustrated in 
Example 4 below. 



In a further important aspect, the present invention relates Co a 
method of producing a transgenic non-human mammal capable of expres- 
sing human ^-casein, comprising injecting an expression system of the 
invention as defined above into a fertilized egg or a cell of an 
embryo of a mammal so as to incorporate the expression syscem inco 
the germline of the mammal and developing the resulting injected 
fertilized egg or embryo into an adult female mammal. 



The incorporation of the expression system into the germline of Che 
mammal may be performed using any suicable cechnique, e.g. as descri- 
bed in "Manipulating the .Mouse Embryo"; A Laboratory Manual, Cold 
Spring Harbor Laboratory Press, 1986. For instance, a few hundred 
molecules of the expression system may be directly injected into a 
fertilized egg, e.g. a fertilized one cell egg or a pro-nucleus 
thereof, or an embryo of the mammal of choice and the microinjecced 
eggs may then subsequently be transferred into the oviducts of pseu- 
dopregnanc foster mothers and allowed to develop. Normally, not all 
of chp injected eggs will develop into adult females expressing human 
3-casein. Thus, about half of the mammals will from a sCaCisCically 
point of view be males from which, however, females can be bred in 
Che following generacions . 

Once integrated in the germ line, the DNA sequence encoding human 
0 casein may be expressed at high levels to produce a correcclv proces- 
sed and funccional human ^-casein in sCable lines of Che mammal in 
quescion . 



Of fi'VLu-ri. inceresi; is a mechod of producing a transgenic non-human 
mammal capable of expressing human 5-casein and substantially in- 
capable of expressing ^-casein from the mammal itself, comprising 

1) destroying the mammalian p-casein expressing capability of Che 
mammal so that substantially no mammalian ^-casein is expressed and 
inserting an expression system of the invention as defined above or a 
DNA sequence encoding human ;3- case in into the germline of the mammal 
in such a manner that human ^-casein is expressed in the mammal 
and/or 

2) replacing the mammalian ^-casein gene or part thereof with an 
expression system of the invention as defined above or a DNA sequence 
encoding human ^-casein. 

The mammalian ;3-casein expressing capability is conveniently destroy- 
ed by introduction of mutations in the DNA sequence responsible for 
the expression of the ^-casein. Such mutations may comprise mutations 
which make the DNA sequence out of frame, or introduction of a scop 
codon or a deletion of one or more nucleotides of the DNA sequence. 

The mammalian 5-casein gene or a part thereof may be replaced with an 
expression system as defined above or a DNA sequence encoding human 
S-casein by use of the well known principles of homologous recombina- 
tion . 

t 

In a further aspect, the present invention relates to a transgenic 
non-human mammal prepared by a method as described above. 

While the transgenic non-human mammal of the invention in its broad- 
est aspect is not restricted to any particular type of mammal, the 
mammal will normally be selected from the group consisting of mice, 
rats, rabbits, sheep, pigs, goats and cattle. For large scale produc- 
tion of human ;9-casein the larger animals such as sheep, goats, pigs 
and especially cattle are normally preferred due to their high milk 
production. However, also mice, rabbits and rats may be interesting 
due to the fact that the manipulation of these animals is more simple 
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and recvlrv in >. i •.>.'< ^iniic animals more quickiv than when, e.g. cac- 
tle, are concerned. 

Also progeny of a transgenic mamaal as defined above, capable of 
producing human ^-casein is vithin the scope of the present invt .\- 
5 tion. 

In a still further aspect, the present invention relates to a cachod 
of preparing human ^-casein comprising collecting milk from a trans- 
genic non-human mammal of the invention as defined above and recover- 
ing the human ^-casein from the milk. The milk may be collecced in 
.0 any suitable manner normally used in connection with the collection 
of milk from the mammal in question. 



From the above explanation it will be clear that the present inven- 
tion for the first time makes it possible to produce milk from a 
non-human mammal comprising human ^-casein, the importance and ucili- 

5 ty of which will be apparent from the present context. Thus, a fur- 
ther aspect of the present invention includes milk from a non-human 
mammal comprising recombinant human ^-casein. Of particular interest 
is milk from a non-human mammal comprising a polypeptide of the 
invention as defined above comprising the amino acid sequence shown 

0 in Figure 1 or a polypeptide encoded by a DNA sequence or an analogue 
or subsequence thereof as defined above. Typically, the milk of the 
invention will be obtained from a transgenic mammal of the invention 
as defined above. ■ 



From the above explanation it will be apparent that an important use 
5 of the polypeptide of the invention is as a nutrient supplement, in 
particular as a substituent of an infant formula. 



In a still further aspect, the present invention relates to an infant 
formula comprising recombinant human case in, in particular a poly- 
peptide of the invention as defined above. The infant formula may be 
0 prepared by adding the recombinant human ^-casein or polypeptide in a 
purified or partly purified form to the normal constituents f the 
infant formula. However, normally it is preferred that the in£ant 
formula is prepared from milk of the invention as defined above, 



especially when it is of bovine origin. The infaiiL formula may be 
prepared using conventional pr cedures and concain any necessary 
additives such as minerals, vitamins etc. 

In a further aspect, the present invention relates to a generally 
novel method of isolating and/or purifying casein proteins from a 
mixture of components including proteins different from casein. The 
method comprises adding to the mixture of components from which the 
casein is to be isolated and/or purified ammonium sulphate In a con- 
centration equal to or of above 0.05 M to precipitate the casein 
proteins, separating the precipitated /3-casein protein from the mix- 
ture of components, and optionally removing salt from the precipi- 
tated casein. 

The purification method of the invention is based on the fact that 
proteins show a decreased solubility in the presence of salts. This 
effect is knovm as "salting out". 

The method is believed to be generally applicable for the Isolation 
of any kind of casein proteins from any kind of mixture, including a 
mixture in which the casein proteins are naturally present, e.g. 
milk, and a mixture in which the casein proteins have been artifi- 
cially added or synthesized, e.g. from a mixture of cells or cell 
components in which recombinant casein has been produced. The cells 
may be microorganism cells, such as bacterial or yeast cells, or 
mammalian cells. t 

Conventionally, casein in milk has been separated by acid precipita- 
tion at its isoelectrical point, e.g. as described by Rowland, 1938. 
However, acid precipitation has certain disadvantages including the 
need of pH-control, which disadvantages are especially pronounced in 
connection with large-scale casein purification. 

Thus it is surprising to find that ammonium sulphate in a concentra- 
tion as low as 0.05 M in the separation of casein from the mixture of 
components, e.g. as described in Example 5, provided a very selective 
casein separation method as compared to conventional methods. The use 
of ammonium sulphate at the low concentration is inexpensive and may 
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easilv be used in large-scale isoLacion and/or puriticacion of case- 
in . 

Alrhough che various cypes of caseins, e.g. a-casein, ^-casein, and 
/c-casein, cannoc be separaced from each other by use of the method 
5 disclosed above, iz is believed that on the basis of the very selec- 
tive casein separation method of the invention, the various types of 
caseins optionally present in this isolated casein precipitate can be 
further separated, if necessary by chromatography. 



More particularly, the casein isolating method of the present inven- 
tion may be used in the isolation of recombinant human ;9-casein from 
a mixture of bacterial or yeast cells in which it is produced, e.g. 
as disclosed above. Thus, in a further aspect, the present invenci.n 
relates to a method of isolating recombinant human ^-casein from 
bacterial or yeast cells as defined above which, in the case of 
intra-cellular expression of recombinant human /9-casein in the bacte- 
rial or yeast cells, comprises separating the cells harboring the 
recombinant human ^-casein from the culture medium, rupturing the 
separated cells so as to allow them to release their concent of incer 
alia recombinant human i-casein, optionally removing cell debris from 
the mixture of ruptured cells and precipitating the released ^-casein 
by addition of ammonium sulphate at a concentration equal to or above 
0.05 M. 

The precipitate is consequently separated from the mixture in which 
it is present, the salt is optionally removed and the ^-casein is 
concentrated, e.g. by freeze drying. 

The separation of the bacterial or yeast cell from the culture medi- 
um, the optional removal of the cell debris, and the separation of 
the precipitate are conveniently carried out by cencrlfugation or by 
sedimentation or by other conventionally used separation methods. 
The rupturing of the bacterial or yeast cells may be accomplished by 
any suitable method such as by sonication, rupture by a freeze/thaw 
treatment, a French Press treatment or the like. 



Auv salL may be removed by conver.cional methods, e.g. by dialysis or 
size-exclusion chromacography . The isolated and optionally purified 
^-casein is preferably concentrated by ultrafiltration and/or ly phi- 
lization . 

The ;9-casein precipitate may advantageously be subjected Co one or 
more wash treatments before it is concentrated, or, in Che event Chat 
the method comprises removal of salt, before this is done. 

When the recombinant human ^-casein has been produced excra-cellular- 
ly, it is in most cases not necessary to rupture the baccerial or 
yeast cells because the recombinant human ^-casein has already been 
excreted into the medium. However, if the extra-cellular expression 
is not satisfactorily effective and a large part of Che produced 
recombinant human ^-casein is present in the cells, it may of course 
be an advantage to rupture such cells. However, normally when excra- 
cellular expression is concerned, the method of isolating recombinanc 
human ^-casein from bacterial or yeast cells as defined above com- 
prises removing the bacterial or yeast cells from the culture medium, 
precipitating the recombinant hionian /3-casein present in Che culture 
medium by addition of ammonium sulphate in a concentration equal Co 
or above 0.05M. separating the precipitated recombinanc human ^- 
casein from the medium, and optionally removing Che sale and subjecC- 
ing Che isolated ^-casein to concentration. 

When recombinant human ^-casein of the invention is to be sep&raced 
from milk produced by a cransgenic non-human mammal as described 
above, the ammonium sulphate is added directly to Che milk and Che 
precipitated casein comprising the recombinanc human casein is re- 
covered from the milk. The casein fraction which may comprise ocher 
types of casein than the recombinant human ^-casein, e.g. bovine 
casein proteins, may be subjected to further purification so as Co 
purify, e.g., the recombinant human 5-casein therefrom, e.g. by 
chromatography or fractionated desalting. The ammonium sulphate 
concencracion used in the purification method of Che invenCion is 
preferably above 0.05 .M, such as equal to or above 0.06 M, e.g. equal 
to or above 0.07 M. However, also higher concentrations such as con- 
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cencracions equal co or above 0.075 M, 0.08 M and ev n 0.09 and 0.1 M 
are contemplated to be suitable for the purpose. 

The invention is further described with reference to the accompanying 
drawings in which 

3 Figure 1 discloses the complete nucleotide and deduced amino acid 
sequences of the human /9- casein cONA fragment obtained as described 
in Excunple 1. The position of the oligonucleotide probe used for 
screening of the cDNA library is shown by underlining. The broken 
arrows show the positions of the different oligodeoxyribonucleoClde 

0 primers used for sequencing of the cDNA fragment, 

Figure 2 the human ^-casein cDNA sequence useful for expression in E. 
coll and cloned as an EcoRI fragment into pUC 19 (as described in 
Example 2) . The cDNA was modified by introduction of syncheclc oligo- 
nucleotides in both the 5' and 3' termini. The modifications altered 
5 the codcn usage in the 5' end, added an excra stop codon Co the 3' 
end and introduced restriction sices which facillCate cloning inco 
the expression vectors, 

Figure 3 the expression vector pS 26 conscructed as described in 
Example 2 and useful for intra-cellular expression of human ^-casein 

0 in £. coll , 

Figure 4 the expression vector pS 28 constructed as describe<d in 
Example 2 and useful for extra-cellular expression of human ^-casein 

in E . coll , 

Figure 5 the result of an SDS-FAGE analysis of soluble /9-casein ex- 
5 pressed in E. coll BL21 (DE3) pLys S performed as described in the 
legend to Figure 7 . 

A) SES-PAGE separation of soluble intracellular proteins of IPTG 
induced E. coll BL21 (DE3) pLys S carrying pS 26. The gel was stained 
with Coomassie Brilliant Blue R-250. 
0 Lane 1; Molecular weight standard 94, 67, 43, 30, 20.2 and 14 kD 
( Pharmacia) . 



Lanes 2-5. SciuDie pioLein IiacLiun L, 2, 3 and 8 pi, respectively. 

B) Western blot of the same fraction as in A. 

Lanes 1-4 contain the same samples as lanes 2-5 in A. 

Lane 5: Prestained molecular weight standard 110, 84, 47, 33, 24 and 
16kD (Bio-Rad). The samples were separated by SDS-PAGE, transferred 
to Immobiline (Millipore) membranes, visualized using polyclonal 
rabbit antibodies produced using highly purified human /9-casein. 
followed by an alkaline phosphatase labelled swine anci -rabbit Ig 
(Dakopatts ) . 

C) SDS-PAGE separation of soluble protein fraction. The gel was 
stained as in A. 

Lane 1: E. coli BL21 (DEB) pLys S uninduced. 
Lane 2: As 1, but IPTG induced. 

Lane 3: E. coll BL2i (DE3) pLys S transformed with pS 28 uninduced. 

Lane 4: As lane 3, but IPTG induced. 

Lane 5: Molecular weight standard as in A, Lane 1. 

Lane 6 : Purified human native ^-casein. 

Figure 6 the expression vector pS 232 used for extra-cellular expres- 
sion of human ^-casein in S. cerevisiae and further described in 
Example 6, 

Figure 7 the result of an Western blot analysis of cell extracts and 
growth medium from yeast cells containing the pS232 plasmid. Uhe 
proteins are separated on a 10-17.52 gradient polyacrylamide gel 
followed by electroblotting to a nitrocellulose filter. Prior to 
hybridization the filter was blocked with 2 . 5X BSA. As a first an- 
tibody purified rabbit polyclonal antibodies against ^-casein were 
used and as a second antibody Dakopatts alkaline phosphatase con- 
jugated affinity purified swine immunoglobulines to rabbit IgG were 
used. The filter was developed with nitro blue tetrazoliiim and 5- 
bromo -4-chloro - 3 - indolylphosphatase . 

Lane 1: .Molecular weight marker 110, 84, 47, 24 and 16kD. 
Lane 2: pS 232A^' 519 25 fil cell lysate. 
Lanes 3 and 4: blank. 

Lane 5: pS 232A'>ri' 519 25 pi culture medium. 
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Lane 6: 5A»g of purified native huir.an p-casein 

Lane T: Purified recombinant: hujcan 5-caseir. produced in E. coli BL21 
(DE3) pLys S containing pS 25. 
Lar.e 8: pYES 2.0/V)C: S19 25 m1 cell lysace . 
5 Lane 9: Blank. 

Lane 10: pYES 2.0/V>ri' 519 25 fil culture medium. 

Figure 8 



r.) the organization of the human S-casein cDNA inserted into the 
aouse WAP gene as described in Example 4, 

B) the expression vector pS 133 used for expression of human ^-casein 

in mice and co:istrucred as described in Example 4, 



Figure 9 the sequence of the Kpnl and Sail fragment generated by PGR 

as described in Example 4, 



Figure 10 -he result of a western blot analysis of milk samples 
derived from three mouse lines transgenic for human ^-casein, carry- 
ing the pS133 derived vector fragment (Example 4). The proteins are 
separated on SDS-PAJE transferred to Imirobilon membranes (Millipore) 
and visualized by polyclonal rabbit antibodies produced using highly 
purified human ^-casein, followed by alkaline phosphatase labelled 
svine anti-rabbit Ig CDakopatts) . In each of the lanes, 10 fil of the 
whev fraction vas loaded. i 



i_ar.i 1 

Lane 2 

_ane 3 

Lane 4 

Lane 5 



pS 133 transgene line 69. 
pS 13 3 transgene line 72. 
p5 133 transgene line 79. 
Non- transgene control line. 

50 ng of recombinant E. coli produced human ^-casein. 



Figure 11 the results obtained after purification of recombinant 
human ^-casein as described in Exam.ple 5. 



A) SDS-PAGE analysis of purified recombinant ^-casein produced in E. 
coll. The gel was stained with Coomassie Brilliant Blue R-250. 
Lane 1: Low molecular weight standard (Pharmacia). 
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Lanes 2-4: Anunoniumsulphace precipitated /9-casein ac increasing 
concentrations . 

B) Western blot analysis of recombinant ^-casein pruified as in A). 
Lanes 1 and 5: Prestained low molecular weight standard (BloRad) . 
5 Lane 2: Native ^-casein purified from human milk. 

Lanes 3 and 4: Recombinant ^-casein purified as in A), and 

Figure 12 the purity of recombinant human ^-casein purified as de- 
scribed in Example 5 and assayed by ion-exchange chromatography. The 
sample was loaded onto the column in 20mM ethanolamine-HCI , 6M urea, 
D pH 9.5 and e luted with a linear gradient up to 0.6M NaCl in the same 
buffer . 



1 



SUBSTITUTE SHEET 
ISA/EP 



wo S--:. ::-!: . • 



PCT/DK9 1/00233 



References 



Brar.tl, V. Novel opioid peptides derived from human ^-casein: Human 
5-casorr.orphins . Eur. J. Pharmacol. 106, 213-214. 1984. 



Brinscer, R.L., Allen, J.M., Behringer, R.R., Gelinas, R.E. and 
5 Paimicer. R.D. Incrcns increase cranscripcional efficiency in trans 
eenic mice. Proc . Nacl. Acad. Sci. USA 85. 836-840, 1988. 



Campbell. S.M., Rosen. J.M., Hennighausen . L.G., Screch-Jurk, U. and 
Sippel. A.E. Comparison of the whey acidic protein genes of the rat 
and couse. Nucleic Acids Res. 12, 8685-8697, 1984. 



10 Clarl. . . Simons, P.. Wilmuc. I. and Lathe, R. Pharmaceuticals 

fror. transgenic livestock. TIBTECH 5. 20-24, 1987. 



Gill. D.R., Hatfuil, G.F. and S.almond, G.P.C. A new cell division 
operon in E. coli. Mcl. Ger,. Ger.et. 205, 134-145, 1986. 

Greenberg, R.. Groves. M.L. and Dovm, H.J. Human /3-casein, amino acid 
15 sequence and identification of phosphorylation sites. J. Biol. Chem. 
25?, 5132-513E. 1984. 



Hennighausen, L.G., Ruiz, L. and Wall, R. Transgenic animals - pro- 
duction of foreign proteins in milk. Curr. Opinions Biotechn. 1, 74 

1 



20 Jicinet-Flores , R.. Kang. Y.C. and Richardson, T. Biochem. Biophus. 
Res. Coimun. 142, 617-621, 1987. 

Koh.T.ura. .M. , Nio, N'.. Kubo, K. , .Minoshima, Y. , Munekata, E. and Ari- 

yoshi . V. Inhibition of Angiotensin-converting enzyme by synthetic 

peptides of human /9-casein. Agric. Biol. Chem. 53. 2107-2114, 1989. 

25 Kunt , C. and Lonnerdal , B. H'oman milk proteins: analysis of casein 

and casein subunits by anion- exchange chromatography, gel electropho- 
resis, and specific staining methods. Am. J. Clin. Nucr. 51, 37-46, 
1990. 



Laemmli. U.K. Nacure 227. 680, 1970. 



Lonnerdal, B. , Bergscrdm, S., Andersson, Y. , Hjalmarsson, K. , Sund- 
qvist, A.K. and Hernell, 0. Cloning and sequencing of a cDNA encoding 
human milk ^-casein. FEES Lett. 269, 153-156, 1990. 

Maruyama, S., Nakagomi , K. , Tomizuka, N. and Suzuki, H. Angiotensin 
I -converting enzyme inhibitor derived from an enzymatic hydrolysate 
of casein. Isolation and bradykinin- potentiating activity on the 
uterus and the ileum of rats. Agric. Biol. Chem. 49, 1405-1410, 1985. 

Menon, R.S. and Ham, R.G. Human ^-casein: partial cDNA sequence and 
apparent polymorphism. Nucleic Acids Res. 17, 2869, 1989. 

Picken, R.N. , Mazaitis , A.J., Maas , W.K. , Rey, M. and Heyneker, H. 
Nucleotide sequence of the gene for heat -stable enterotoxin II of 
E. coli. Infect. Immun. 42. 269-275. 1983. 

Provot, C. , Persuy, M.A. and Mercier. J.C. Complete nucleotide se- 
quence of ovine beta-casein cDNA: inter-species comparison. Biochimie 
71, 827-32, 1989. 

Rowland, S.J. J. Dairy Res. 9, 47-57. 1938. 

( 

Sambrook, J., Fritsch. E.F. and Maniatis , T.E. Molecular cloning, a 
laboratory manual. 2nd ed. Cold Spring Harbor Laboratory Press, 1989. 

Studier, F.W. , Rosenberg, A.H. , Dunn, J.J. and Dubendorff, J.W. Use 
of T7 RNA Polymerase to direct expression of cloned genes. In Methods 
in Enzymology, ed. David V. Goeddel, p. 60-89. Academic Press, 1990. 

Miller. Mark J. S. et al.. Proc. Soc. Exp. Biol. Med. 195, 143-159, 
1990. 



SUBSTITUTE SHEET 
ISA/EP 



wo 93/04171 PCT/DK9 1/00233 

33 

Additional references dealing with cDKA and genes coding for milk 
proteins . 

11 

Bonsing, J. and Mackinlay, A.G. Recent studies on nucleotide sequen- 
5 ces encoding the caseins. J. Dairy Res. 54, 447-61, 1987. 

12 

Hall, L. , Emery, D.C., Davies , M.S., Parker, D. and Craig, R.K. Orga- 
nization and sequence of the human alpha- lactalbumin gene. Biochem. 
J. 242. 735-42, 1987. 

10 13 

Blackburn, D.E., Hobbs , A. A. and Rosen, J.M. Rat beta casein cDNA: 
sequence analysis and evolutionary comparisons. Nucleic Acids Res. 
10, 2295-2307, 1982. 

14 

15 Jones, W.K., Yu-Lee, L.Y. , Clift, S.M., Brown. T.L. and Rosen. J.M. 
The rat casein multigene family. Fine structure and evolution of the 
beta-casein gene. J. Biol. Chem. 260, 7042-7050. 1985. 

15 

Yoshimura, M. . Banerjee, M.R. and Oka, T. Nucleotide sequence of a 
20 cDN.A encoding mouse beta casein. Nucleic Acids Res. 14. 8224, 1986. 

( 

16 

Yoshimura, M. and Oka, T. Isolation and structural analysis of the 
mouse beta-casein gene. Gene 78, 267-275. 1989. 

17 

25 Menon, R.S and Ham, R.G. Hvunan ^S-casein: partial cDNA sequence and 
apparent polymorphism. Nucleic Acids Res. 17, 2869, 1989. 

18 

Stewart. A.F., Bonsing, J., Seattle, C.W., Shah, P., Willis, I.M. , 
.Mackinlay, A.G. Complete nucle tide sequences of bovine alpha 52- and 



34 

beta-casein cDNAs : comparisons with related sequences in other spe- 
cies. Mol. Biol. Evol. 4, 231-41, 1987. 

19 

Provot, C. , Persuy, M.A. and Mercier, J.C. Complete nucleotide se- 
quence of ovine beca-casein cDNA: inter- species comparison. Biochimie 
71. 827-32, 1.989. 

20 

Devinoy, E., Schaerer, E. , Jolivet, G., Fontaine, M-L. , Kraehenbuhl , 
J. P. and Houdebine. L.M. Sequence of the rabbit alpha Sl-casein cDNA. 
Nucleic Acids Res. 16, 11813, 1988. 

21 

Schaerer. E., Devinoy, E. , Kraehenbuihl , J. P. and Houdebine, L.M. 
Sequence of Che rabbit beta- casein cDNA: comparison with other casein 
cDN'A sequences. Nucleic Acids Res. 16, 11814, 1988. 



I 



The following examples are incer.ded ~o illuscr =.ce but not to limit 
the present in\-enticn. 

Cons-ructiyn of the expression systems of the invention, and the 
molecular htiological characterization of it, employs standard methods 
generally known in tha art of recombinant DNA. Unless otherwise 
stated, the methods used are those described by Sambrook et al. , 
1989 . 

DEFINITIO.VS 
Hybridiz^cicr of DSA 

DNA. e.g. present on nitrocellulose filters, are wetted in 2 x SSC 
:i X SSC; G.15 M NaCl , C.0015 M Nai-citrate, pH 7.0] and placed in a 
heat-sealed plastic bag with pre-w.-.rmed (67°C) prehybridization solu- 
tion. Prehybridization takes place for 2 h at 67°C, the bag being 
gently shaken. The solution is exchanged with pre-warmed (67'C) 
hybridization solution, the radioactive probe is added and hybridiza- 
tion is carried out at 67°C for 18 h. The bag is gently shaken to 
ensure constant movemenz of the liquid over the nitrocellulose fil- 
ters. After hybridization, a washing procedure is carried out. 

The radioactive probe is prepared by use of known methods,' e.g. as 
described by Sambrook et al., on the basis of the DNA sequence shown 
in Sequence Listing 1 or a part thereof, especially a coding part 
such as the nucleotides corresponding to amino acids 1-210 or an 
effective subsequence of the DNA sequence as defined above. 

The prehybridization and hybridization solutions used are: 10 x Den- 
hardt'i. 4 x SSC, O.IX SDS . 10 fig/ml polyA. 50 MS/n»l of denatured DNA 
to be analysed and the denatured (heat) radioactive probe. The fil- 
ters are washed in pre-warmed (57 'C) solutions: 10 x Denhardt, 
2 X SSC, 0.1% SDS for 2 x 15 min. and 1 x SSC, O.IZ SDS for A x 15 
T.in. The filters are air-dried and covered with Vita-Wrap, and X-ray 
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film is exposed Co the filters for 3 h to 3 weeks with and without 
intensifying screens. 



EXAMPLE lA 

Cloning and sequencing of cDNA encoding hvunan ^-casein 

5 A A-gt 11 human mammary gland cDNA library prepared from lactational - 
ly competent adult human mammary gland was obtained from Clontech 
Lab., Palo Alto, CA. The clones of the human ;9-casein were screened 
by plaque hybridization using E. coli Y 1090. A synthetic 42-mer 
oligonucleotide probe 

10 5 ' - GAGCAAGGGAAGAGGCAAATGAAGATTTTCAAGATCAGTCAA - 3 ' 

corresponding to amino acids 117-130 in the ^-casein sequence (Green- 
berg et al . , 1984) was synthesized. The construction of this oligo- 
nucleotide was based on a region of the bovine /9-casein amino acid 
sequence having a large degree of homology with the hiunan counter- 

15 parr. However, the selection of the synthetic oligonucleocide in- 
cluded the following modifications of the bovine cDNA sequence (Jimi- 
nez-Flores et al. . 1987) for the nucleotide sequence 449-490: i) nu- 
cleotide 458 (G-C exchange), ii) nucleotide 480 (C-T) , iii) nucleo- 
tide 483 (T-'C) . The oligonucleotide probe was synthesized on a Beck- 

20 man 200A DNA synthesizer using the phophoramidite technique accord- 
ing to the vendor's instructions. The probe was [7--^^P] dATP-labelled 
using T4 polynucleotide kinase (New England Biolabs; Beverly. MA). 
Hybridization was carried out for 12-15 hours at 40''C, and the mem- 
branes were washed and autoradiographed on X-ray film (Amersham, UK). 

25 Six positive plaques were identified in the primary screening. 

Following secondary screening, phage DNA of purified clones were 
isolated from plate lysates. Restriction mapping was performed and 
the ^-casein cDNA was localized in the cloned fragment by Southern 
blotting . 



3.7 

One of cne A-gc 11 clones carrying an insert: hybridizing to the 0- 
casein ii-mer probe was digested with the restriction end nuclease 
EcoRI, and the cDNA insert was separated from DNA by electrophoresis 
in Vi Sea Kern GTG Agarose (FMC BioProducts, ME). The- cDNA fragment 
was ligated to EcoRI -digested alkaline phosphatase- treated pUC19 DNA 
(Pharmacia) and transformed into E. coli TGI (Studier ec al.), Trans- 
formants were . selected on plates containing 100 ng/ml of carbenicil- 
lin, 40 Mg/ml of 5 -bromo-4-chloro- 3 - indolyl -^-D-galactoside (X-gal) 
and 1 mM isopropyl-^-D-thiogalactoside (IPTG, Sigma, Sc. Louis, MO). 

recombinant plasmid carrying the cDNA insert was identified and 
designated pS 21. Plasmid pS 21 DNA was subjected to restriction 
endonuclease analysis. The complete nucleotide sequence of both 
strands of the region encoding /9-casein was determined, using T7 
sequencing kit (Pharmacia, Uppsala, Sweden), on double stranded 
templates as described by the vendor. As primers for sequencing 
reacrions, specific oligonucleotides complementary to pUC19 or fi- «• 
casein sequences were used. 

The nucleocide sequence contained an open reading frame sufficient to 
encode the entire anino acid sequence of a ^-casein precursor protein 
20 consisting of 210 amino acids and a signal peptide of 15 amino acids 
(Figure 1) . 

EICAMPLE IB 

I 

Genetic variants of hxunan ^fi-casein 

It is assumed chat human /3-casein exists in a limited number of 
2: genetic variants. These variants will have a number of amino acid 

substitutions as compared to the amino acid sequence deduced from the 
cDNA sequence shown in Figure 1. The asstimption is based on the fact 
that most other species investigated to date do have genetic vari- 
ants, but also on the discrepancies found between the obtained cDNA 
20 sequence (Figure 1) and the sequence determined by classical amino 

acid sequencing (Greenberg et al.). Genetic variants, i.e. analogues 
of the DNA sequence shown in Figure 1, may be isolated and characte- 
rized by the following procedure: 
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DNA is isolated from fresh human milk provided by donors with varying 
genenic background (ethnicity). Similarly. mRNA is isolated from 
fresh milk and cDNA is synthesized by use of the reverse transcrip- 
tase methodology. Using specific synthetic oligonucleotides, selected 
5 from regions flanking sequences with pronounced amino acid discrepan- 
cies, DNA fragments are synthesized by the use of the PGR technique. 
Synthesized DNA fragments are isolated from agarose gels and sequen- 
ced by the dideoxy chain- termination method. 

EXAMPLE IC 

10 Cloning, sequencing and organization of genomic DNA coding for human 
^-casein 

A genomic clone containing the transcribed part of the human ^-casein 
gene is isolated using the cloned human ^-casein described in Example 
lA as a probe. 

15 The probe is the 1075 bp EcoRI fragment derived from pS 21, isolated 
by agarose gel electrophoresis and radioactively labelled by using a 
multiprime DNA labelling system (Amersham) according to Che sup- 
plier's instructions. This labelled probe is used for screening of a 
human genomic library cloned in a lambda vector. The screening proce- 

20 dure is as described in Sambrook et al . , 1989. 

The transcribed region of the human ^-casein gene is expected to be 
in a 7 kb genomic fragment. The overall structure of the gene is 
expected to show more that four exons , probably nine exons , inter- 
rupted by introns located within a 10 kb, probably within the 7 kb 
25 genomic fragment. The cloned genomic fragment is sequenced by using 
the same procedures as described above. 



EJvA-MPLE 1 



Expression of reconbinanc h\iaan ^-casein in Escherichia coll 

The cDNA encoding the pro-polypeptide for human ^-casein was isolated 
as described above as a 1075 bp EcoRI fragment: and cloned into pUC19, 
generating pS 21. 

The cD:."A termini were modified as follows (Figure 2) , pS 21 was 
digested with EcoRI and AccI and a 328 bp fragment was isolated by 
agarose gel electrophoresis. The isolated 328 bp fragment was puri- 
fied from the agarose gel by electroelution and digested with Avail, 
and an Avail and AccI 197 bp fragment was isolated. 

Oligonucleotides with suitable restriction sites and altered codon 
usage adapted for E. coll expression encoding the amino terminus of 
mature p- casein, SYH 132S, S^-M 1329, SYM 1330. SYM 1331. SYM 1332 and 
£YM 1332, were synthesized. The sequences are listed below: 



S'l'M 13 28 5 

SYM 1329 ^ 

S\yi 1330 5 

SYM 13 31 5 

SYM 13 32 5 

S'l'M 13 3 3 5 



-CTCGAGCGAAGAATCGATCACCGAA- 3 ' 
-GAATTCATATGCGTGAAACCATCGAATCCCTGAG- 3 ' 
-TACAAAAAAGTTGAAAAAGTTAAACACGAGGACCAG- 3 ' 
- CTTTTTTGTATTCGGTGATCGATTCTTCGCTCGAGCTCAGGGATT- 3 ' 
-GATCC7GGTCCTCGTGTTTAACTTTTTCAA- 3 ' 
-CGATGGTTTCACGCATATGAATTCTGCA- 3 ' 



The six oligonucleotides described above were annealed and ligated 
into PstI and BamHI cleaved pUC19. and the resulting plasmid was 
sequenced and designated pS 24. An 89 bp PstI and Avail restriction 
fragment was isolated from pS 2^. 

For modification of the 3' end. pS 21 was digested with AccI and a 
6^1 bp fragment was isolated; this fragment was eleccroeluted and re- 
diEested with Bglll, and a 303 bp fragment was isolated. The 303 bp 
fragment was subcloned into AccI and Bglll digested pS 2 (Symbicom) , 
generating pS 22. pS 22 was digested with EcoRI and Bglll. and a 
fragment of approximately 2.96 kb was isolated. 
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In order to ir.trodui;t; suiLable rescriccion sites and m dify the 3' 
untranslated sequence, oligonucleotides were synthesized and ligated 
with the 2.96 kb fragment derived from pS 22. 



The following oligonucleotides were used: 



5 SYM 1335 5 • - CCAGTTCATAACCCCATTAGTGTCTAATAAGGATCCG - 3 ' 

S YM 1336 5 ' - GATCTACCCTGTGACTCAGCCACTTGCC - 3 ' 

SYM 1338 5' -AATTCGGATCCTTATTAGACACTAATGG-3' 

SYM 1339 5 ' - GGTTATGAACTGGGGCAAGTGGCTGAGTCACAGGGTA- 3 ' 



The resulting plasmid was sequenced and designated pS 23. 

10 To obtain the modified fragment encoding the macure ^-casein, the 

following three fragments were ligated: first a P^Cl and AccI frag- 
ment of approximately 3.0 kb derived from pS 23, second the PstI and 
Avail 89 bp fragment derived from pS 24, and third the 197 bp Avail 
and AccI fragment from pS 21, resulting in pS 25. 

15 The ^-casein encoding sequence without signal was isolated from pS 
25 as a 641 bp Ndel and BamHI fragment and introduced into the Ndel 
and BamHI digested vector plasmid pS14, and the resulting ^-casein 
expression vector was designated pS 26 (Figure 3). 

The vector pS 26 carries the bacteriophage T7 (plO promoter and <p 
20 terminator (Studier, 1990) to regulate the expression of ^-casein. It 
also contains the origin of replication and sequences encoding aia- 
piciliin and resistance of pBR322. This vector was analyzed with 
appropriate enzymes and relevant segments were sequenced. 



In order to allow secretion of recombinant human ^-casein, a sequence 
25 encoding the signal peptide of heat stable enterotoxln ZI, STII, of 

ZscherLchia. coli was introduced. 

pS 25 was digested with Aval and EcoRI and a 619 bp fragment encoding 
the major part, except for the amino terminus, of /9-caseln was Iso- 
lated. This fragment was cloned into Ndel and EcoRI cleaved pUC19 
30 together with synthetic olig nucleotides. The synthetic oligonucleo- 



tides encode cne very amino cerninal end of macure ^-casein with 
appropriate restriction sites allowing fusion with the STII signal 
sequence. The following oligonucleotides were used; 

SYK 1^95 5 ' -TATGCACGTGAAACCATCGAATCCCTGAGC- 3 ' 
5 Si'M 15C0 5' -TCGAGCTCAGGGATTCGATGGTTTCACGTGCA-3' 

This three fragment ligation resulted in the plasmid pS 27. This 
plasmid was confirmed by sequence analvsis. 

In the next step pS 27 was cleaved with Ndel and Hindlll and a 700 bp 
fragment was isolated and sequenced. This 700 bp fragment was incro- 

10 duced into the STII containing expression vector pS 29 which was 

digested with Hindlll and Ndel. The resulting expression vector for 
huaan recombinant ^-casein was designated pS 28 (Figure A) . pS 28 
contains the same regulatory elements, replication signals and resis-* 
tance markers as pS 26. The construction was confirmed by restriction 

15 analysis. 

The expression vectors pS 25 and pS 28 were transformed into the 
following Escherichia coli strains: BL21(DE3), BL21(DE3) pLysE and 
BL21(0E3) pLysS (Studier, 1990). The expression experiments were 
carried out essentially as described by Studier et al. , 1990. The 
2C obtained results demonstrated that recombinant human ;5-casein was 
efficiently expressed as a soluble protein with the two different 
expression vectors pS 26 and pS 28 (Figure 5). However, the E. coli 
produced human ^-casein shows a somewhat lower apparent molecular 
weight . 

25 EXAMPLE 3 

Expression of recombinant hximan ^-casein in Saccharomyces cerevisiae 

A fragment encoding the entire ^-casein polypeptide was isolated as a 
698 bp Kpnl and Sail fragment from pS133 (Figure 8, and further 
described in Example 4) by using agarose gel electr phoresis. The 
30 isolated fragment was ligated with Kpnl and Xhol digested pYES2.. 0 (In 
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Vitrogen Corporation, U5a) . The resulting /S-casein expression vector, 
designated pS 232 (Figure 6) , was analyzed by restriction enzyme 
mapping . 

The expression vector contains a polylinker and is designed for high 
5 level expression in 5. cerevisiae . The vector is designed for main- 
tenance in both £. coli and yeast. 

The vector contains the Gal 1 portion of the Gal 1/Gal 10 promoter 
region from S. cerevisiae for inducible expression of genes inserted 
into the polylinker. It also contains the CYCl transcription termina- 

10 tion signal, the 2/i origin of replication and partitioning element 

and the URA 3 gene for selection in yeast. The ampicillin resistance 
gene and the replication origin derived from pUC 19 as well as the T7 
RNA polymerase promoter and Fl origin are present for maintenance and 
selection, transcription and single strand rescue, respectively, in 

15 E. coll. 

The Gal 1 promoter contains the sequences required for the regulation 
of transcription initiation as well as the mRNA start site, but does 
not contain an ATG for translation initiation. Therefore, the trans- 
lation initiation must be located at the inserted sequence. 

20 The expression vector pS 232 was transformed into the Saccharomyces 

cerevisiae host strain UMY 519 (URA" , Gal"*") (A. Bystrom, personal 

_ t 

communication) , and selection was performed in uracil deficient 
medium. Colonies growing at restrictive conditions were isolated and 
analyzed. 

25 A positive colony was cultured in minimal medium lacking uracil and 
containing 2X glucose to a density of about 0.5 00500* The culture 
was then centrifugaced and the pellet was washed with distilled 
water. The washed pellet was divided. One part was resuspended in the 
same medium as above, and the other part was resuspended in the same 

30 medium except that glucose was substituted with 2X galactose. The two 
pellets were resuspended to a density of about 0.1 ODgQQ. The tw 
cultures were grown overnight to a density of about 3.0 ODgQO' 
the cultures were harvested. 



After cencrifugation of the cultures, the pellets w re resuspended 
and the supernatants were mixed with SDS , urea and mercaptoethanol 
containing sample buffer (Laemmli et al. , 1970) and boiled for 20 
minutes. The samples were then loaded on a SDS -PAGE 10-17.5X gradient 
gel (Laemmli et al., 1970). After separation the gels were either 
stained with Coomassie blue (Sigma, St. Louis, USA) or treated for 
protein transfer to nitrocellulose membranes for antibody detection 
using polyclonal rabbit anti-^-casein antibodies affinity purified 
on highly pure native casein coupled to an affinity gel. The obtained 
results demonstrated efficient expression of recombinant human fi- 
casein in yeast. The samples were co-electrophoresed with purified 
native human ^-casein (obtained by acid precipitated human native fi- 
casein further purified by ion exchange chromatography (Rowland, 
1938, Kunz and Lonnerdal, 1990)) and the recombinant molecules mi- 
grated in an identical manner, indicating correct and intact protein ^ 
(Figure 7). 

The expression levels were estimated to be more than 50 mg per litre 
of culture of recombinant human ^-casein. 

EXAMPLE 4 

Expression of recombinant human ^-casein In transgenic mice 

i 

A plasmid containing the murine whey acidic protein, WAF, gene as a 
7.2 kb genomic fragment cloned at EcoRI sites was obtalnea from Dr. 
Lothar Hennighausen (Campbell et al. , 1984). This plasmid was digest- 
ed with EcoRI and Kpnl and separated on agarose gel to obtain the 
approximately 2.5 kb upstream regulatory element. The plasmid was 
also cleaved with Sail and EcoRI and separated on agarose gel, and 
Che fragment containing the third exon, the last intron, the fourth 
exon and downstream positioned mRMA processing signals as an approxi- 
mately 2.3 kb large DNA fragment. These two fragments were cloned 
into a plasmid constructed as a fusion of pUC19 and pUC9 to eliminate 
the restriction sites in the polylinker except for th EcoRI site, 
together with a human cDNA fragment, generating pS 88. 
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In order to facilitate cloning of human /9-casein to the WAP regulato- 
ry elements by addition of Kpnl and Sail sites and to introduce a se- 
quence identical to the murine 5' untranslated upstream of the start 
codon, the original cDNA clone, pS 21, was modified by using polyme- 
5 rase chain reaction. Two PGR primers were synthesized, SYM and 
SYM 2045, with the following sequences: 

SYM 2044 5' -CGGGTACCCTAAAGGACTTGACAGCCATGAAGGTCCTCATCCTCGCCTGCCTGG- 
3' 

SYM 2045 5' -CGGTCGACTTAGACACTAATGGGGTTATGAACTG-S' 



10 The plasmid pS 21 was used as template in a polymerase chain reaction 
with the two primers. The obtained 698 bp fragment encoding human /S- 
casein with a Kpnl site in the 5' end and a Sail site in the 3' end 
and a modified 5' untranslated sequence was then cloned into the 
approximately 7.5 kb Kpnl and Sail digested pS 88 fragment and analy- 

15 zed by DNA sequencing. The resulting expression vector was designated 
pS 133 (Figure 8). The sequence of the Kpnl and Sail PGR generated 
fragment is listed in Figure 9. 



The expression vector pS 133 is capable to mediate stage and tissue 
specific expression of recombinant human ;3-casein in transgenic anl- 
20 mals. The regulatory elements direct gene expression to the mammary 
gland during lactation, thereby allowing isolation of the hetero- 
logous protein from milk. 

Before injection of the expression vector into a pro-nucleus of a 
fertilized egg, pS 133 was digested with EcoRI and the WAP ^-casein 
25 fragment was isolated on agarose gel followed by electroelutlon. The 
eluted DNA was precipitated and redissolved in 10 mM Iris (pH 7.5) 
and 0.1 mM EDTA for microinjection. 

The experimental procedures employed to obtain transgenic animals is 
described in Manipulating the Mouse Embryo: A Laboratory Manual, Gold 
30 Spring Harbor Laboratory Press, 1986. 
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EXAMPLE 5 

Purification of recombinant ^-casein fr m E. coli 

E. coli cells expressing recombinant human ^-casein and prepared as 
described in Example 2 above were separated from the culture by 
5 centrifugation and the resulting pellet was freezed and thawed seve- 
ral times so as to disrupt the cells. Other methods for cell disrup- 
tion includes osmotic shock, pressure change and sonicatlon. After 
an additional centrifugation, the supernatant was collected and 
ammonium sulphate at varying concentration starting from 0.05 M was 
10 added so as to precipitate /9-casein. 

SDS-polyacrylamide gels (iO-17Z) (Figure llA) as well as liquid 
chromatography (Figure 12) were run to analyze purity and Western 
blots (Figure IIB) to analyze identity. Antibodies used in Western 
blots were raised in rabbit using highly purified native human 0- 
15 casein, purified from milk as described in Example 2 above, and puri- 
fied by affinity chromatography on immobilized hximan ^-casein. No 
cross -reactivity was seen with host proteins. 

DEPOSITION 

Plasmid DNA, designated pS 21, pS 26, pS 28, pS 133 and pS 232, has 
20 been deposited in the collection of Deutsche Sanmlung von Hikroorga- 
nismen und Zellkulturen GmbH, Mascheroder Weg lb, D-3300 Braun- 
schweig, Germany, on August 19, 1991 in accordance with the provision 
of the Budapest Treaty and identified there by accession niimbers DSM 
6653, DSM 6654, DSM 6655, DSM 6656, and DSM 6657, respectively. 
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CU^IMS 

L. A DNA sequence encoding a poiypepcide comprising Che amino acid 
sequence shown in Figure 1 or 

an analogue of. said DNA sequence which 

5 1) hybridizes with Che DNA sequence shown in Figure 1 under 

scrineenc hybridlraclon condicions or 

2) encodes a polypeptide, the amino acid sequence of which is at 
least S5Z homologous wlch the amino acid sequence shown in 
Figure 1, or 

10 3) constitutes an effective subsequence of said DNA sequence, 

which analogue encodes a polypeptide having either Che calcium bind- 
ing activity of human ^-casein, or opioid activity, or angiotensin 
converting enzyme (ACE) inhibitory activity, or a combination of any 
two or three of these activities. 

15 2. A DNA sequence which comprises a human ^-casein gene or an analo- 
gue of said DNA sequence which 

1) hybridizes with the DNA sequence shown in Figure i or a 
specific part thereof under stringent hybridization conditions, 

2) encodes a poiypepcide, the amino acid sequence of which is at 
20 lease 85X homologous with the amino acid sequence shown in 

Figure 1 , or 

2) conscicutes an effective subsequence of said DNA sequence, 

-hich analogue encodes a polypepride having either the calcium bind- 
ing activity of human ^-casein, or opioid activity, or angi Censin 
25 converting enzyme (ACE) inhibitory activity, or a combination f any 
two or three of these activities. 
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3. A DNA sequence accoramg co claim L which comprises ch DNA se- 
quence shown in Figure 2 . 

^. A DNA sequence according co any of claims i-3 encoding a polypep- 
tide comprising amino acid sequence L-210 shown in Figure 1. 

5 5. A modified DNA sequence which differs from a DNA sequence as 
defined in any of claims 1-4 in that at lease one nucleotide has 
been deleted, subscituced or modified or ac least one additional 
nucleotide has been inserted so as co result in a DNA sequence which 
encodes a polypeptide having a calcium-binding which is siailar to, 
10 increased or decreased as compared co the calcium-binding activity of 
human ^-casein, or a polypeptide having ACE- inhibitory activity, or 
a polypeptide having opioid activity. 

6. Recombinant human ^-casein comprising a polypeptide having the 
amino acid sequence shown in Figure 1 or a variant thereof having the 

15 calcitua- binding activity of human d-casein or a subsequence of said 
amino acid sequence or variant having ACE- inhibitory activity or 
opioid activity. 

7. A recombinant polypeptide encoded by a DNA seqticnce as claimed in 
any of claims 1-5. 

20 8. A polypeptide according co claim 7 comprising the amino acid 
sequence shown in Figure 1. 

9. A polypeptide according co claim 8 comprising amino acid sequence 
1-210 shown in Figure 1. 

10. A polypeptide according co any of claims 6-9 in which at least 
25 one amino acid residue has been substituted with a different amino 

acid residue and/or in which at least one amino acid residue has been 
deleted or added so as co result in a polypeptide comprising an amino 
acid sequence being different from che amino acid sequence shown in 
Figure 1 and having a similar, increased or decreased calcium -binding 
30 activity as compared to the calcium-binding activity f human ^- 
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casein, or having ACE- inhibitory activity, or navmg opioid activity, 
or having a combination of any two or three of these acclvlcies. 

11. A recombinant polypeptide according to any of claims 6-10 in 
which at least one amino acid residue has been modified by post- 

5 trans lational modification. 

12. .'\ polypeptide according to claim 11 in glycosylated, and/or phos- 
phorylated form. 

13. \ replicable expression vector which carries and is capable of 
mediating the expression of a DNA sequence encoding human ^-casein. 

iO 14. A vector according to claim 13 in which the DNA sequence encoding 
human p- casein is as defined in any of claims 1-5. 

15. .A cell harbouring a vector as defined in any of claims 13 or 14, 

16. A cell according to claim 15 which is a prokaryoclc, a unicellu- 
lar eukaryotic cell or a cell derived from a mulcicellular organism. 

15 17. A method of producing a polypeptide according to any of claims 
5-12, comprising 

inserting a DNA sequence as defined in any of claims 1-5 in a vector 

t 

which is able to replicate in a specific hose cell, introducing the 
resulting recombinant vector into a host cell and growing the result- 
20 ing cell in or on an appropriate culture medium under appropriate 
conditions for expression of the polypeptide and recovering the 

polN-peptide . 

18. .A polypeptide according to any of claims 6-12 produced by a 
method as defined in claim 24. 

25 19. A plasmid selected from the plasmids designated pS 21, pS 26, pS 
2S. pS 133 and pS 232 all of which have been deposited on 19 August, 
1991 with the collection of Deutsche Sammlung v n Mikr organismen und 
Zellkulturen GmbH (DSM) under the accession numbers DSM 6653, DSM 
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66b4. DSM 6655, DSM 6656, and DSM 6657, in accordance with Che provi- 
sions of the Budapest Treaty. 

20. A mammalian expression system comprising a DNA sequence encoding 
human ^-casein inserted into a gene encoding a milk protein of a 

5 non-human mammal so as to form a hybrid gene which is expressible in 
the mammary gland of an adult female of a mammal harbouring said 
hybrid gene so that human )9-casein is produced when Che hybrid gene 
is expressed. 

21. An expression system according to claim 20 in which Che DNA 

10 sequence encoding human ^-casein is a DNA sequence as defined in any 
of claims 1, 2 or 4-7. 

22. An expression system according to claim 20 or 21 in which the 
gene encoding a milk protein is selected from a whey acidic protein 
(WAP) gene. 

23. A hybrid gene comprising a DNA sequence encoding human ^-casein 
inserted into a gene encoding a milk protein of a non.-h\iman mammal, 
the DNA sequence being inserted in the milk protein gene in such a 
manner that it is expressible in the mammary gland of an adult female 
of a mammal harbouring the hybrid gene . 

24. A non-human mammalian cell harbouring an expression system as 
defined in any of claims 20-22. 

25. A mammalian cell according to claim 24 which is an embryo cell. 

26 . .\ method of producing a transgenic non-human maimnal capable of 
expressing human ^-casein, comprising injecting an expression system 
as defined in any of claims 20-22 into a fertilized egg or a cell of 
an embryo of a mammal so as to incorporate the expression system into 
the germline of the mammal and developing the resulting injected fer- 
tilized egg or embryo into an adult female mammal. 
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27. A -echod of producinc a transgenic non-human manunai capable of 
expressing human ^-casein and sufascanciaily incapable of expressing 
5-casein from Che mammal itself, comprising 

1) destroying the mammalian ^-casein expressing capability of the 
mammal so chat substantially no mammalian ^-casein is expressed and 
inserting an expression system as defined in any of claims 20-22 into 
the gcrmline of the mammal in such a manner chat human )8-casein is 
expressed in the mammal and/or 

2) replacing the mammalian ^-casein gene or pare chereof wlch an 
expression system as defined in any of claims 20-22. 



28 . A transgenic non-human mammal 
human ^-casein. 

29. A transgenic non-human mammal 
DNA sequence is a DNA sequence as 

30. A transgenic non-human mammal 
which the DNA sequence is present 



harbouring a DNA sequence encoding 

as claimed in claim 28 in which Che 
defined in any of claims 1-5. 

as claimed in claim 28 or 29 in 
in the germline of the mammal. 



31. A transgenic non-human mammal as claimed in claim 30 in which the 
DNA sequence is present in a milk protein gene of the mammal. 

32. A transgenic non-human mammal prepared by the method of claim 26 
or 27. 

33. A mammal according co any of claims 28-32 which is selected from 
the group consisting of mice, rats, rabbits, sheep, pigs and cattle. 

34. Progeny of a transgenic mammal as defined in any of claims 28-33. 

35. A method of preparing human ^-casein comprising collecting milk 
from a mammal as claimed in any of claims 28-34 and recovering the 
human ^-casein from the milk. 
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36. Milk from a non-human mammal comprising recombinant human fi- 
casein. 

37. Milk from a non-human mammal comprising a polypeptide as defir ;d 
in any of claims 6 -12. 

S 38. .Milk obtained from a transgenic mammal as defined in any or 
claims 28-33. 

39. The use of a polypeptide according to any of claims 6-12 or 18 
as a nutrient supplement. 

40. The use according to claim 39 in which the nutrient supplement is 
10 included in an infant formula. 

41. The use of a polypeptide according to any of claims 6-12 or 18 in 
che preparation of an infant formula. 

42. An infant formula comprising recombinant human ^-casein. 

43. An infant formula comprising a polypeptide as defined in any of 
15 claims 6-12 or 18. 

44 . An infant formula prepared from milk as defined in any of claims 
36-38. 1 

45. A method of isolating a casein protein from a mixture of compo- 
nents comprising 

20 adding ammonium sulfate in a concentration equal to or above 0.05 H 
to the mixture of components so as to precipitate the casein protein, 
and 

separating the precipitated casein protein from the mixture of compo- 
nents . 
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Ud . A mechod of isolacing reccmbinanc human ^-casein from bacterial 
or veasc cells in vhich ic is substantially intracellularly produced 
coiKprising 

separating the cells harbouring the recombinant ^-casein from Che 
culture medium, 

rupturing the separated cells so as to release their content of 
recombinant human ;3-casein, 

optionally removing cell debris from the mixture of ruptured cells, 

and precipitating the released recombinant human /S-casein by addition 
of ammonium sulfate in a concentration equal to or above 0.05 M so as 
to obtain a precipitated ^-casein, and 



isolating the precipitated ^-casein. 

1*1. A method of isolating recombinant human ^-casein from a culture 
of bacterial or yeast cells in vhich it has been substantially excra- 
cellularly produced comprising 

removing the bacterial or yeast cells from the culture mediiun, 

adding ammonium sulfate in a concentration equal Co or above 0.05 M 
so as to obtain a precipitated ^-casein, and 

isolating the precipitated ^-casein. 
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-15 -1 +1 

MKVLIL'-ACLVALALARETI 
cggatgaaggtcctcatcctcgcctgcctgatggctcttgctcttgcaagggagaccata 
1 " 60 



10 20 
ESLS.S SEESIT EYKKVEKVK 
gaaagcctttcaagcagtgaggaatctattacagaatacaagaaagttgagaaggttaaa 

120 

30 40 
HEDQQQGEDEHQDKIYPSFQ 
catgagaaccagcagcaaggagaggatgaacaccaggataaaatctacccctctttccag 
121 180 

50 r - fi- '^-""'^ 60 

PQ„P. LIYPFVEPIPYGFLPQN 
ccacagcctctgatctatccattcgttgaacctatcccctatggttttcttccacaaaac 
181 240 



70 80 
ILPLAQPAVVL PVPQPEIME 
attctgcctcttgctcagcctgctgtggtgctgcctgtccctcagcctgaaataatggaa 
241 > 300 



90 100 
VPKAKDTVYTKGRVMPVLKS 
gtccctaaaqctaaaaacactgtctacactaagggcagagtgatgcctgtcqttaaatct 
301 360 



lie 120 
PTIPFFDPQIPKLTDLENLH 
ccaacgatacccttttttgaccctcaaatcccaaaactcactgatcttgaaaatptgcat 
361 — 



130 140 
LPLPLLQPLMQQVPQPIPQT 
cttcctctgcctctgctccagcccttgatgcagcaggtccctcagcctattcctcagact 
480 



w o 93/04171 



PCr/DK9 1/00233 



2/17 

150 160 
LALPPQPLWSVPQPKVLPIP 
cttgcacttccccctcagcccctgtggtctgttcctcagcccaaagtcctgcctatcccc 
481 



170 180 
QQVVPYPQRAVPVQALLLNQ 
cagcaagtggtgccctaccctcagagagctgtgcctgttcaagcccttctgctcaaccaa 
> 600 

190 200 
ELLLNPTHQIYPVTQPLAPV 
gascttctacttaaccccacccaccagatctaccctgtgactcagccacttgccccagtt 
601 660 

210 

H N P I S V * 
cataaccccattagtgtctaagaagatttcaaagttaattttccctccttatttttgaat 
661 720 

tgactgagactggaaatatgatgccttttccgtctttgtatcacgttaccccaaattaag 
721 780 

tatgtttgaatgagtttatatggaaaaaatgaactttgtccctttatttattttatatat 
781 840 

tatgtcattcatttaatttgaaatttgactcatgaactatttacattttccaaatcttaa 
841 < . 900 

ttcaactagtaccacagaagttcaatactcatttggaaatgctacaaacatatcaaacat 
901 960 

atgtatacaaattgtttctggaattgtgcttatttttatttctttaagaatctatttcct 
961 1020 



ttccagtcatttcaataaattattcttaagcataaaaaaaaaaaa 
1021 1065 
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50 60 



« 



. — r- TCGCC 73CCT GGTGG CTCTr 

titzz — ^ — ^ ^c::: Agcgg acgga ccacc <^<^^ 



CCATG GG.-.TT . 



e: 



ic: 110 12C 



J ;,GA.=.-. GCC7T TCAAG AGG;A TCTAT TACAG AATAZ 

i^rlTJ 5:^:: rc-r cgg;:^ agttc gtcac tcctt agata atgtc ttatg 

ic- 170 180 



T 3 AA - - ■ 



2c: 21C 22: 



GAGAG GATGA ACACC AGGAT 
CTCTG CTACT TG7GG TCCTA 

230 240 



-CAC AGCCr CTGAT CTA-TZ CATTC GTTGA ACCTA TCGCC 
GATAC- GIAAG CAACT TGGAT AGGGC- 



rT CGGTj TwOviA uAw*A 



290 3c: 



sj — - 



C--.- - - -^-W- -.^-rtVJw 

GTAA^ AC*jGA ai'o-wj 



330 



CTGOT GTGGT GCTGO CTG. - 
GACGA CACCA CGACG GACAG 

350 360 



A70GA AG7C0 C'V^A G07A.A AGAw. C7G7C 7ACAC 7AAGG GCAGA 
7GAGG GA777 CGA77 7C7G7 GACAG A7G7v3 A.^C_ CGaC. 



2=: 3s: 40: 



420 



GZTAT ^jjor--" 



77G AC7C7 CAAA7 CCCAA AAC7C 
AAO 7GG3A G777A GGG77 iiSAG 

46: 470 480 



ZZZTZ C7C7G CG7C7 GC770 AGCCO 77GA7 GCAG^ t^^'Z. 

CGAGG 70GGj AAC7A CG7CG 7CCA»j 



AGAA.Z 7777.-. G.-.C37 n^r^nj Uf^Uii- (jvm.vj.-i 

49: 5:: 51: 



1 

53C 540 



C;-"- CA"^" CGC7G TGGTC TG77C CTCAG 

:• A-IU: G7GAA GGGGG AG773 GGGAC ACCAG ACAAG GAGTC 

570 5S0 590 60C 

CZZ^.-. AG70: 7G707 A700: COA.GZ AAG73 G7GCG CTACO C7CAG AGAGC TGTGC CTGT7 
77A--:. A7GG.A 7AGGG GG7C3 77CA0 CACGG GA7GZ- GAG7C 7GTC^ ACACv, GAC~. 



63; 



64: 650 660 



•O rt^OJ^ — 



A.n'-A A-- A.- 7A C77AA CCCCA CCZfi^C CAGA7 CTACO C7GTG 

770 07 7077G A.AGA7 GAA77 GGGG7 GGG7G GTCTA GATGG GACAO 



t = : 65: ''^ 

70A07 CGG7G A-^^OGG G077A AG7A7 7GGGG T^^-- ..wAo w..- 



^ — — — . — . i. A7TAG 7G7G7 AAG7C GAC 
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